MEMORY ACCESS METHOD FOR VIDEO DECODING 



BACKGROUND OF THE INVENTION 

Field of the invention 

The invention relates to a memory access method, and in particular to a memory 
5 access method in which a picture is divided into several macroblocks and stored in the 
memory block by block, and the top and bottom field of each macroblock are separately 
stored in different consecutive addressing areas of pages. According to this method, 
the DRAM burst mode can be utilized for both frame and field access to substantially 
increase effective DRAM bandwidth. In this way, the drawbacks in both the 
10 frame-organized method and field-organized method can be avoided. 

Description of the Related Art 

In current video decoding systems, such as MPEG-I, MPEG-II, H.261; and the 
like, the inter-frame compression technology is often used to reduce the data 
redundancies between pictures so as to obtain a better data compression result. The 

1 5 systems all involve algorithms based on a common core of the compression technology, 
such as predictive and/or inter-polative inter-picture encoding. The motion 
compensation is block-based, and each prediction block has associated motion vectors. 
The motion compensation operation involves reading the prediction block from a 
reference picture according to the motion vectors. A reference picture is used for 

20 forward and backward prediction. The reference picture can be either a reference- 
frame-picture, or a reference-field-picture having one reference-top-field-picture and 
one reference-bottom-field-picture. A reference-frame-picture is encoded as a frame 



picture, and contains lines of spatial information of a video signal. These lines of spatial 
information contain samples starting from the uppermost of the picture continuing 
through successive lines to the downmost of the picture. A reference-field-picture is 
encoded as a field picture, and is the assembly of alternate lines of a frame. Each line of 
5 a reference-top-field-picture is spatially located immediately above the corresponding 
line of the reference-bottom-field-picture. The reference pictures are quite large and 
typically stored in d3aiamic random access memories (DRAMs). The DRAM has 
several memory banks, and each bank includes many memory pages. However, there 
is a problem of cross-page penalty in the DRAM. The DRAM can be addressed by the 

10 column address and the row address, and a different row address is addressed to a 
different page. Before reading the data in different pages, it is necessary to activate 
and pre-charge the to-be-read page. Consequently, reading the data from different 
pages may slow down the speed for the video decoder to read the reference picture. In 
order to mitigate the cross-page penalty, it is desired to store each prediction block into 

1 5 fewer pages during data accessing, and then to access data in a page-by-page manner. 

The increasing demand for more memory bandwidth requires the use of 
sophisticated memory devices like DDR SDRAM or RDRAM (Rambus DRAM). To 
obtain high performance, these devices have two main features: the burst-access mode 
and the multiple-bank architecture. The burst-access mode enables access to a number 
20 of consecutive addressing data words by simply giving a burst read command or a burst 
write command. The burst read command or burst write command includes a 
burst-starting-address indicating the starting address of the data to be retrieved, and a 
burst-access-length indicating the size of the data to be retrieved. The multiple-bank 
architecture has the feature that each bank can be accessed alternately. A burst access 



2 



operation majorly includes three steps. Firstly, issue a row-activation command so as 
to copy the row data of a designated bank into the sense amplifier. Secondly, issue a 
burst access command to fetch data of the size indicated in the burst-access-length from 
the sense amplifier and then send the fetched data to the data bus. Thirdly, access data 
5 stored in other columns but in the same row by changing only the column address 
without issuing any additional row-activation command. Since these steps are 
pipelined with an external clock, SDRAM or RDRAM can operate at higher clock 
frequency, and commands issued to different banks can be overlapped. For example, a 
row-activation command of one bank can be overlapped with burst access operations of 
10 another bank. So the number of cycles for the row-activiation can be reduced. 
Therefore, if we can access the reference picture by burst access mode, the effective 
DRAM bandwidth can be greatly boosted up. 

In a typical video decoding system, a reference picture has two fields, i.e., top 
field and bottom field. The top field includes all the odd number data rows in the 

15 reference picture, while the bottom field includes all the even number data rows in the 
reference picture. The top field and the bottom field can be samples retrieved at the 
same or different time instant in the time axis. To achieve the best compression ratio 
and reconstructed image quality, frame-based prediction and field-based prediction 
mode are used. The frame-based prediction is a prediction mode using both the top 

20 field and bottom field of the reference picture, and the field-based prediction is a 
prediction mode using only one field of the reference picture. In the case that the top 
and bottom fields of the frame are sampled at a same time instant, frame-based 
prediction is used. In the case that the top and bottom fields of the frame are sampled 
at a different time instant, field-based prediction is used. In another aspect, since the 
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reference picture can be divided into many macroblocks, each macroblock includes a 
portion of the top field and a portion of the bottom field. The specific portion of the 
top field belonging to a macroblock will be described as "the corresponding top field of 
the macroblock" in the following description. Similarly, the specific portion of the 
5 bottom field belonging to a macroblock will be described as "the corresponding bottom 
field of the macroblock." 

To perform the motion compensation operation in a fi-ame-based prediction, the 
frame access method that concurrently accesses both the top field and bottom field is 
used. On the other hand, to perform the motion compensation in a field-based 
10 prediction, the field access method that accesses either the top field or the bottom field 
is used. 

FIG. 1 is a diagram showing a conventional arrangement wherein the reference 
picture is stored in the memory by a firame-organized storage method. In the 
firame-organized storage method, a reference-firame-picture is directly stored in a firame 

15 buffer 1. On the other hand, the reference-top-field-picture and the 
reference-bottom-field-picture are interlaced and combined as a firame structure, and 
then stored into a firame buffer 1. FIG. I illustrates a page 10 stored in a firame buffer 1 . 
Please note that in this drawing, each address is directed to double words (4 bytes) of 
data. The picture has 720*576 pixels, and contains 720*576 bytes of data if each pixel 

20 is represented by one byte of data, so the firame buffer 1 contains 720*576 bytes of data. 
Each page has 32*32=1,024 bytes of data, and includes four 16*16 bytes macroblocks 
11, 12, 13 and 14. As described above, each macroblock includes a portion of the top 
and bottom fields, wherein the shadowed areas denote the bottom field in this drawing. 
There are drawbacks for this conventional firame-organized storage method. For 



example, for accessing the top or bottom field in a field-based prediction, the 
above-mentioned DRAM burst-access mode can't be used since the accessed addresses 
are not consecutively located. For example, if we want to read the top field of the 
macroblock 11 in FIG.1, the read addresses are not consecutively located, we can't use 
5 DRAM burst access mode to read these data. Hence, the effective DRAM bandwidth 
will be substantially decreased. 

FIG. 2 is a diagram showing a conventional arrangement wherein the reference 
picture is stored in the memory by a field-organized storage method. In the 
field-organized storage method, the reference-top-field-picture and the 

10 reference-bottom-field-picture are respectively stored in different field buffers. On the 
other hand, the reference-firame-picture is composed of a reference-top-field-picture and 
a reference-bottom-field-picture, which are stored in different field buffers. Also, 
assuming the picture has 720*576 pixels, and contains 720*576 bytes of data if each 
pixel is represented by one byte of data, each of the field buffers 2 and 2' has 720*288 

1 5 bytes of data. The field buffers 2 and 2' are different portions of a memory. The page 
20 has 32*32 bytes of data, and stores eight portions of the top field, indicated as 21 to 
28 in FIG. 2(A); the page 20' has also 32*32 bytes of data, and stores eight portions of 
the bottom field, indicated as 21' to 28' in FIG. 2(B). In the field-organized method, 
since the top and bottom fields are stored in different field buffers 2, 2', the 

20 corresponding top and bottom fields of each macroblock will be stored in different 
pages. Although the field-organized method can avoid the drawbacks of the 
firame-organized method, another disadvantage occurs in the frame access of a 
fi:ame-based prediction. When a system perform the fi-ame access in a frame-based 
prediction so as to fetch both the top and bottom field data, cross page penalties will 
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occur since the top and bottom fields of each macroblock are stored in different pages. 
Hence, the effective DRAM bandwidth of field-organized method will be lowered due 
to the inevitable cross page penalties of frame access. 



SUMMARY OF THE INVENTION 

5 In view of the above-mentioned problems, it is therefore an object of the invention 

to provide a memory access method for video decoding in which a reference picture is 
stored in a memory in the unit of macroblocks, and the corresponding top and bottom 
field of each macroblock are separately stored in different consecutive addressing areas 
of pages. According to this method, a system can fully utilize DRAM burst mode for 
10 both frame and field access to substantially increase effective DRAM bandwidth. 
Furthermore, we can avoid inefficient DRAM access in the frame-organized method, 
and avoid undesirable cross-page penalty in frame access in the field-organized method. 

To achieve the above-mentioned objects, a new method of storing pixel data of a 
reference picture and retrieving a prediction block of the reference picture from a 

15 memory is provided. According to the preferred embodiments, the memory is divided 
into several pages. The reference picture consists of a top field and a bottom field, and 
is divided into several macroblocks. Each macroblock has a corresponding portion of 
the top field and the bottom field of the reference picture. The method includes the 
steps of: dividing each page into a top-section having one or more consecutive 

20 addressing area and a bottom-section having one or more consecutive addressing areas; 
storing the corresponding top field of the macroblocks into the top-sections, and the 
corresponding bottom field of the macroblocks into the bottom-sections; and then 
retrieving the pixel data of the prediction block stored in the memory in a page-by-page 
manner. 

25 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIG 1 is a diagram showing a conventional arrangement wherein the reference 
picture is stored in the memory in the unit of macroblocks. 

FIG 2 is a diagram showing another conventional arrangement wherein the 
reference picture is stored in the memory in the unit of macroblocks. 

FIG. 3 is a schematic illustration showing the reference picture divided in the unit 
of macroblocks according to the first embodiment of the present invention. 

FIG 4 is a diagram showing an arrangement wherein the reference picture is 
stored in the memory in the unit of macroblocks according to the first embodiment of 
the invention. 

FIG. 5A is a schematic illustration showing the corresponding memory addresses 
when the reference picture is stored in the memory in the unit of macroblocks according 
to the first embodiment of the invention. 

FIG 5B is a schematic illustration showing the corresponding memory addresses 
when the reference picture is stored in the memory in the unit of macroblocks according 
to the first embodiment of the invention, wherein the addressing sequence is different 
from that in FIG 5A. 

FIG 6A is another schematic illustration showing the memory addressing areas of 
the top and bottom field data in a page according to the second embodiment of the 
invention. 

FIG 6B is another schematic illustration showing the memory addressing areas of 



the top and bottom field data in a page according to the second embodiment of the 
invention, wherein the addressing sequence is different from that in FIG. 6A. 

DETAILED DESCRIPTION OF THE INVENTION 

The memory access method for video decoding of the preferred embodiments will 
5 be described with reference to the accompanying drawings. Although the MPEG format 
is described as an example of the memory access method of the invention, the scope of 
the invention is not limited thereto. 

FIG. 3 illustrates the pixel data of a reference picture of 720*576 pixels divided 
into many macroblocks. Assuming each pixel is represented by one byte of data, and 

10 then the reference picture 3 contains 720*576 bytes of data. In this embodiment the 
size of each macroblock is 16*16 pixels according to the definition of MPEQ so the 
data quantity of each macroblock is 256 bytes, and each macroblock includes a 
corresponding portion of the top field and the bottom field of the reference picture. 
Consequently, the reference picture is divided into 45*36 macroblocks, which are 

15 respectively denoted as MBO to MB 161 9 in this drawing. Then, the number of 
macroblocks stored in each page is calculated according to the size of the page. 
Assuming the size of each page is 1,024 bytes in this embodiment, four macroblocks 
can be stored in one page since the size of each macroblock is 256 bytes. 

After the number of macroblocks stored in each page of the memory is calculated 
20 and obtained, we store neighboring macroblocks in the reference picture into the page. 
In this embodiment, four macroblocks can be stored in one page, so every four (2*2) 
macroblocks may be regarded as one set of macroblocks for being sequentially stored 
into each corresponding page. The overall 1,620 macroblocks can be grouped as 405 
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sets of macroblocks, and can be stored into 405 pages of the memory. 

FIG. 4 illustrates the 405 sets of macroblocks are stored into the 405 pages of a 
buffer 4 according to one embodiment of the present invention. The buffer 4 is a 
portion of a memory, such as DDR SDRAM or RDRAM. Each page of the memory is 
5 divided into a top-section and a bottom-section. The top-section includes several 
consecutive addressing areas, and so is the bottom-section. The corresponding top 
field of the macroblocks belonging to this page is stored into the consecutive addressing 
areas of the top-section, and the corresponding bottom field of the macroblocks 
belonging to this page is stored into the consecutive addressing areas of the 
10 bottom-section. 

As shown in FIG. 4, the corresponding top and bottom field of the four 
macroblocks of each page are separated, and respectively stored in the top-sections TSO 
to TS404 and the bottom-sections BSO to BS404. The shadowed areas shown in FIG. 4 
denote the bottom-sections for the bottom fields. 

15 FIG. 5 A is a schematic illustration for further illustrating the memory addressing 

areas of the top and bottom sections in a page. The page is divided into a top-section 
TSO and a bottom-section BSO. The top-section TSO contains a consecutive 
addressing area CA2_0, and the bottom-section BSO contains a consecutive addressing 
area CA2_1 . In this drawing, each address is a byte address. That is, each address is 

20 directed to one byte of data. The addressing area CA2_0 contains the addresses from 
OOOH to IFFH, and the addressing area CA2_1 contains the addresses from 200H to 
3FFH. In each of the consecutive addressing areas, the available addresses are 
consecutive. The corresponding top field of the four macroblocks for each page is 
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stored in the consecutive addressing area CA2 0 of the top-section TSO, and the 
corresponding bottom field of the four macroblocks is stored in the consecutive 
addressing area CA2_1 of the bottom-section BSO (indicated as shadowed areas in FIG 
5A). Assuming the data bus width in the system is 32 bits (double word) and each 
5 storage unit shown in the drawing is also set to be double words (4 bytes), then the page 
PO stores the data of the corresponding top field of the macroblock MBO into the 
addresses of OOOH^OIFH, 040H'-05FH, 080H~09FH and OCOHM)DFH, and stores the 
data of the corresponding bottom field of the macroblock MBO into the addresses of 
lOOH-'llFH, 240H-'25FH, 280H--29FH and 2C0H'-2DFH. The addressing areas of 

10 the corresponding top and bottom field of MB 1, MB 45, and MB 46 is also illustrated 
in FIG 5A, accordingly. However, the addressing areas for storing the data of the 
corresponding top and bottom fields of the four macroblocks into a page is not limited 
to the above addressing sequence. Another addressing sequence, such as the one 
shown in FIG 5B, is also feasible. Other addressing sequences which store the 

15 corresponding top field or bottom field of the macroblocks belonging to a page into the 
top-section or the bottom-section may also be adopted here. For example, as shown in 
FIQ 5B, the page PO can also store the data MBO', MB45', MBT and MB46' of the 
corresponding top field of the macroblocks MBO, MB45, MBl and MB46 into the area 
of 000H^7FH, OSOH'-OFFH, 100H--17FH, and 180H--1FFH, respectively, and store the 

20 data MBO", MB45", MBl" and MB46" of the corresponding bottom field of the 
macroblocks MBO, MB45, MBl and MB46 into the area of 200H--27FH, 280H~2FFH, 
aOOH'-STFH, and SSOH-^SFFH, respectively. 

After the method of storing the pixel data of a reference picture into a memory is 
described, we now illustrate the method of retrieving the prediction blocks of the 
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reference picture in this embodiment. The dashed rectangles shown in FIG 3 indicate 
the positions of several prediction blocks. In an example, there are two prediction 
blocks PBl and PB2 in Fig. 3 and Fig. 4. The prediction block PBl has a size of 
16*33 bytes, and the prediction block PB2 has a size of 16*16 bytes. The procedure of 
5 reading pixel data of the prediction blocks PBl and PB2 according to this embodiment 
will be described herein. 

Firstly, we illustrate the field access method in this embodiment by reading only 
the top field of the prediction block PB 1 . The to-be-read areas include sub-areas SB 1 
and SB2, as shown in FIG. 4. Thus, the pages that are to be read when retrieving the 

10 pixel data of the prediction block PBl include pages PO, PI, and P24. As shown in the 
drawing, the sub-area SBl is distributed over the pages PO and PI, while the sub-area 
SB2 is distributed over the page P24. The data belonging to different pages are 
sequentially read in a page-by-page manner in this embodiment so as to reduce the 
number of cross-pages. That is, the data of the sub-area SBl within the page PO is 

15 read first. Then, the data of the sub-area SBl within the page PI is read. Finally, the 
data of the sub-area SB2 within the page P24 is read. It should be noted that the 
sequence of reading these different pages may also be varied according to the spirit of 
the present invention. 

Compared with the conventional firame-organized method, since the top fields of 
20 the macroblocks are consecutively addressed in this embodiment, we can easily use 
DRAM burst access mode to burst access these top field by DRAM burst access. 
Therefore, the effective DRAM bandwidth can be greatly increased. The above 
description illustrates accessing the top field of the prediction block using the DRAM 
burst access mode according to one embodiment. Similarly, the DRAM burst access 



mode can also be used to burst access the bottom field of the prediction block. Details 
will not be described for brevity. 

Secondly, we illustrate the frame access method in this embodiment by reading 
both the top field and the bottom field of the prediction block PB2. The to-be-read 
5 areas include the sub-areas SB3 to SB6, as shown in FIG. 4. Thus, the pages that are to 
be read in this example include pages P3, P26 and P27. The sub-areas SB3 and SB4 
are distributed over the page P3, while the sub-areas SB5 and SB6 are distributed over 
the pages P26 and P27. The data belonging to different pages are sequentially read in 
a page-by-page manner so as to reduce the number of cross-pages. That is, the data of 
10 the sub-areas SB3 and SB4 within the page P3 are read first in this embodiment. Then, 
the data of the sub-areas SB 5 and SB6 within the page P26 are read. Finally, the data 
of the sub-areas SB5 and SB6 within the page P27 are read. 

Compared with the conventional field-organized storage method, since the top 
field and bottom field of each macroblock are stored in the same page in this 

15 embodiment, the number of cross-pages can be substantially reduced since there will be 
no cross-page penalties when it is required to read both the top and bottom field of the 
same macroblock. Furthermore, since the top field or bottom fields of a macroblock 
are consecutive addressed in this embodiment, we can easily use DRAM burst access 
mode to burst access these top field or bottom field by DRAM burst access. Therefore, 

20 the effective DRAM bandwidth can be increased. 

FIG. 6A illustrate another embodiment of the memory addressing areas of the top 
and bottom field in a page according to the present invention. In the first embodiment 
as shown in FIG. 5A, each page of the memory is divided into a top-section TSO having 
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one consecutive addressing area CA2_0, and a bottom-section BSO having one 
consecutive addressing area CA2_1. In the second embodiment as shown in FIG 6A, 
a page PO is divided into a top-section TSO having two consecutive addressing areas 
CA4_0, CA4_1, and a bottom-section BSO having two consecutive addressing areas 
5 CA4_2, and CA4__3. The consecutive addressing area CA4_0 contains the addresses 
from OOOH to OFFH; the consecutive addressing area CA4_1 contains the addresses 
from 200H to 2FFH; the consecutive addressing area CA4_2 contains the addresses 
from lOOH to IFFH; and the consecutive addressing area CA4_3 contains the addresses 
from 300H to 3FFH. In each of the consecutive addressing areas, the available 

10 addresses are consecutive. The data of the corresponding top field of the four 
macroblocks are stored into the two consecutive addressing areas CA4_0 and CA4_1 of 
the top-section TSO, and the data of the corresponding bottom field of the four 
macroblocks are stored into the two consecutive addressing areas CA4_2, and CA4_3 of 
the bottom-section BSO. More particularly in this embodiment, the data of the 

15 corresponding top field of MBO and MB45 are stored into the consecutive addressing 
area CA4_0 of the top-section TSO, and the data of the corresponding top field of MBl 
and MB46 are stored into the consecutive addressing area CA4_1 of the top-section TSO. 
The data of the corresponding bottom field of MBO and MB45 are stored into one 
consecutive addressing area CA_4_2 of the bottom-section BSO, and the data of the 

20 corresponding bottom field of MBl and MB46 are stored into the consecutive 
addressing area CA4_3 of the bottom-section BSO. However, the addressing areas for 
storing the data of the corresponding top and bottom fields of the four macroblocks into 
a page is not limited to the above addressing sequence. Another addressing sequence, 
such as the one shown in FIG 6B, is also feasible. 
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In the above embodiments, the top-section comprises a same number of 
consecutive addressing areas as of the bottom-section. However, people skilled in the 
art will appreciate that the top-section and the bottom-section may comprise different 
numbers of consecutive addressing areas according to the spirit of the invention. For the 
5 sake of brevity, the further details will not be redundantly described here. 

While certain exemplary embodiments have been described and shown in the 
accompanying drawings, it is to be understood that such embodiments are merely 
illustrative of and not restrictive on the broad invention, and that this invention not be 
limited to the specific construction and arrangement shown and described, since various 
10 other modifications may occur to those ordinarily skilled in the art. 
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